AITopics | decoder network

This paper presents the Deep Convolution Inverse Graphics Network (DC-IGN), a model that aims to learn an interpretable representation of images, disentangled with respect to three-dimensional scene structure and viewing transformations such as depth rotations and lighting variations. The DC-IGN model is composed of multiple layers of convolution and de-convolution operators and is trained using the Stochastic Gradient Variational Bayes (SGVB) algorithm [10]. We propose a training procedure to encourage neurons in the graphics code layer to represent a specific transformation (e.g.

artificial intelligence, machine learning, representation, (15 more...)

Neural Information Processing Systems

Country:

North America > Canada > Ontario > Toronto (0.14)
North America > United States > Massachusetts > Middlesex County > Cambridge (0.05)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)
(2 more...)

Technology:

Information Technology > Artificial Intelligence > Vision (1.00)
Information Technology > Artificial Intelligence > Representation & Reasoning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Learning Graphical Models > Directed Networks > Bayesian Learning (0.34)

Add feedback

074177d3eb6371e32c16c55a3b8f706b-Supplemental.pdf

Neural Information Processing SystemsOct-1-2025, 23:16:18 GMT

artificial intelligence, dense layer, machine learning, (17 more...)

Neural Information Processing Systems

Country: North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.05)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

Supplementary Material for: Recursive Inference for V ariational Autoencoders

Neural Information Processing SystemsAug-17-2025, 00:15:42 GMT

The number of flows is chosen from { 1, 2, 4, 8} .

artificial intelligence, machine learning, rme, (16 more...)

Neural Information Processing Systems

Country:

North America > Canada > Ontario > Toronto (0.14)
North America > United States > New Jersey > Middlesex County > Piscataway (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Add feedback

Optimizing Learned Image Compression on Scalar and Entropy-Constraint Quantization

Borzechowski, Florian, Schäfer, Michael, Schwarz, Heiko, Pfaff, Jonathan, Marpe, Detlev, Wiegand, Thomas

arXiv.org Artificial IntelligenceJun-11-2025

The continuous improvements on image compression with variational autoencoders have lead to learned codecs competitive with conventional approaches in terms of rate-distortion efficiency. Nonetheless, taking the quantization into account during the training process remains a problem, since it produces zero derivatives almost everywhere and needs to be replaced with a differentiable approximation which allows end-to-end optimization. Though there are different methods for approximating the quantization, none of them model the quantization noise correctly and thus, result in suboptimal networks. Hence, we propose an additional finetuning training step: After conventional end-to-end training, parts of the network are retrained on quantized latents obtained at the inference stage. For entropy-constraint quantizers like Trellis-Coded Quantization, the impact of the quantizer is particularly difficult to approximate by rounding or adding noise as the quantized latents are interdependently chosen through a trellis search based on both the entropy model and a distortion measure. We show that retraining on correctly quantized data consistently yields additional coding gain for both uniform scalar and especially for entropy-constraint quantization, without increasing inference complexity. For the Kodak test set, we obtain average savings between 1% and 2%, and for the TecNick test set up to 2.2% in terms of Bjøntegaard-Delta bitrate.

artificial intelligence, machine learning, quantization, (17 more...)

arXiv.org Artificial Intelligence

doi: 10.1109/ICIP51287.2024.10648254

2506.08662

Country: Europe > Germany > Berlin (0.04)

Genre: Research Report (0.50)

Technology:

Information Technology > Sensing and Signal Processing > Image Processing (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

Collaborating Authors

decoder network

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

Proactive Object Detection Wrapper Supplementary material

e3844e186e6eb8736e9f53c0c5889527-Supplemental.pdf

2ab47c960bfee4f86dfc362f26ad066a-Paper-Conference.pdf

074177d3eb6371e32c16c55a3b8f706b-Supplemental.pdf

Proactive Object Detection Wrapper Supplementary material

2ab47c960bfee4f86dfc362f26ad066a-Paper-Conference.pdf

Deep Convolutional Inverse Graphics Network

074177d3eb6371e32c16c55a3b8f706b-Supplemental.pdf

Supplementary Material for: Recursive Inference for V ariational Autoencoders

Optimizing Learned Image Compression on Scalar and Entropy-Constraint Quantization